
US006665344B1 



<i2) United States Patent ao) Patent no.: US 6,665,344 Bi 

Fimoff (45) Date of Patent: Dec. 16, 2003 



(54) DOWNCONVERTING DECODER FOR 
INTERLACED PICTURES 

(75) Inventor: Mark Fimoff, Hoffman Estates, IL (US) 

(73) Assignee: Zenith Electronics Corporation, 

Lincolnshire, IL (US) 

( ♦ ) Notice: Subject to any disclaimer, the term of this 
patent is extended or adjusted under 35 
U.S.C 154(b) by 0 days. 

(21) Appl. No.: 09/106,367 

(22) Filed: Jun. 29, 1998 

(51) Int. CI. 7 H04N 7/12 

(52) U.S. Ci 375/240.2 

(58) Field of Search 348/402, 441; 

375/240; 358/133; 708/402 

(56) References Cited 

U.S. PATENT DOCUMENTS 

5,049,992 A 9/1991 Citta et al 358/140 

5,262,854 A * 11/1993 Ng 358/133 

5,633,682 A * 5/1997 Tahara 375/240.23 

5,654,910 A • 8/1997 Iwata 364/725 

5,805,483 A * 9/1998 Shim 708/402 

5,903,311 A * 5/1999 Ozcelik et al 375/240.2 

5,957,998 A * 9/1999 Ozaki 708/402 

6,025,878 A * 2/2000 Boycc ct al 348/402 

6,028,648 A • 2/2000 Yu 375/240.27 

6,184,935 Bl ♦ 2/2001 Iaquiato et al 248/441 

6,175,592 Bl * 6/2001 Kim et al 375/240 

FOREIGN PATENT DOCUMENTS 

EP 0 576 290 12/1993 H04N/7/137 

EP 0 707 426 4/1996 H04N/7/26 

EP 0 786 902 7/1997 H04N/7/30 

WO WO 98/41011 9/1998 H04N/5/44 

OTHER PUBLICATIONS 

Mokry, R., ct al., "Minimal Error Drift in Frequency Scal- 
ability for Motion-Compensated DCT Coding," IEEE 
Transactions On Circuits and Systems for Video Technol- 



ogy, Aug. 1994, vol. 4, No, 4, pp. 392-406. 

Johnson, A. W., et al., "Drift Minimisation In Frequency 
Scalable Coders Using Block Based Filtering," Applied 
Research & Development. 

Johnson, A. W., ct al., "Rational Value Decimation Using the 
DCT For Layered Coding," Monash University, Telecom 
Australia Research Laboratories, (1991). 

Vetro, A "Frequency Domain Down-Conversion of HDTV 
Using An Optimal Motion Compensation Scheme", Inter- 
national Journal of Imaging Systems and Technology, US, 
John Wiley & Sons, New York, vol. 9, No. 4, pp. 274-282, 
(Jan. 1, 1998). 

International Search Report, dated Mar. 4, 2000, Application 
No. PCT/US99/15254. 

* cited by examiner 

Primary Examiner — Chris Kelley 
Assistant Examiner — George A. Bugg 



(57) 



ABSTRACT 



Received frame and field coded DCT coefficient blocks are 
downcon verted to lower resolution reconstructed pixel field 
blocks so that, for example, HDTV programs can be played 
on NTSC receivers. The frame and field coded DCT coef- 
ficient blocks have motion vectors associated therewith. 
Specifically, the received frame coded DCT coefficient 
blocks are converted to converted field coded DCT coeffi- 
cient blocks and an IDCT module is performed on the 
converted field coded DCT coefficient blocks to produce 
downconverted field residual or pixel blocks. Also, an IDCT 
is performed directly on the received field coded DCT 
coefficient blocks to produce downconverted field residual 
or pixel blocks. Reference pixel blocks are selected based 
upon the motion vectors, the reference pixel blocks are 
upsampled, and the upsampled reference pixel blocks are 
downsampled. The upsampled and downsampled reference 
pixel blocks form predictions that are added to the field 
residual blocks in order to form reconstructed pixel field 
blocks. 

44 Claims, 13 Drawing Sheets 
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DOWNCONVERTING DECODER FOR 
INTERLACED PICTURES 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates to a downconverting 
decoder for downconverting and decoding high resolution 
encoded video for display by a lower resolution receiver. 

BACKGROUND OF THE INVENTION 

The international standard ISO/IEC 13818-2 (Generic 
Coding of Motion Pictures and Associated Audio Informa- 
tion: Video) and the "Guide to the use of the ATSC Digital 
Television Standard" describe a system, known as MPEG-2, 
for encoding and decoding digital video data. According to 
this system, digital video data is encoded as a series of code 
words in a complicated manner that causes the average 
length of the code words to be much smaller than would be 
the case if, for example, each pixel in every frame was coded 
as an eight bit value. This type of encoding is also known as 
data compression. 

The standard allows for encoding of video over a wide 
range of resolutions, including higher resolutions commonly 
known as HDTV. In MPEG-2, encoded pictures are made up 
of pixels. Each 8x8 array of pixels is known as a block, and 
a 2x2 array of blocks is known as a macroblock. Compres- 
sion is achieved by using well known techniques including 
(i) prediction (motion estimation in the encoder and motion 
compensation in the decoder), (ii) two dimensional discrete 
cosine transform (DCT) which is performed on 8x8 blocks 
of pixels, (iii) quantization of the resulting DCT coefficients, 
and (iv) Huffman and run/level coding. In MPEG-2 
encoding, pictures which are encoded without prediction are 
referred to as I pictures, pictures which are encoded with 
prediction from previous pictures are referred to as P 
pictures, and pictures which are encoded with prediction 
from both previous and subsequent pictures are referred to 
as B pictures. 

An MPEG-2 encoder 10 is shown in simplified form in 
FIG. 1. Data representing macroblocks of pixel values are 
fed to both a subtractor 12 and a motion estimator 14. In the 
case of P pictures and B pictures, the motion estimator 14 
compares each new macroblock (i.e., a macroblock to be 
encoded) with the macroblocks in a reference picture pre- 
viously stored in a reference picture memory 16. The motion 
estimator 14 finds the macroblock in the stored reference 
picture that most closely matches the new macroblock. 

The motion estimator 14 reads this matching macroblock 
(known as a predicted macroblock) out of the reference 
picture memory 16 and sends it to the subtractor 12 which 
subtracts it, on a pixel by pixel basis, from the new mac- 
roblock entering the MPEG-2 encoder 10. The output of the 
subtractor 12 is an error, or residual, that represents the 
difference between the predicted macroblock and the new 
macroblock being encoded. This residual is often very small. 
The residual is transformed from the spatial domain by a two 
dimensional DCT 18. The DCT residual coefficients result- 
ing from the two dimensional DCT 18 are then quantized by 
a quantization block 20 in a process that reduces the number 
of bits needed to represent each coefficient. Usually, many 
coefficients are effectively quantized to zero. The quantized 
DCT coefficients are Huffman and run/level coded by a 
coder 22 which further reduces the average number of bits 
per coefficient. 

The motion compensator 14 also calculates a motion 
vector (mv) which represents the horizontal and vertical 
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displacement of the predicted macroblock in the reference 
picture from the position of the new macroblock in the 
current picture being encoded. It should be noted that motion 
vectors may have V4 pixel resolution which is achieved by 

5 linear interpolation between adjacent pixels. The data 
encoded by the coder 22 are combined with the motion 
vector data from the motion estimator 14 and with other 
information (such as an indication of whether the picture is 
an I, P or B picture), and the combined data are transmitted 

io to a receiver that includes an MPEG-2 decoder 30. 

For the case of P pictures, the quantized DCT coefficients 
from the quantization block 20 are also supplied to an 
internal loop that represents the operation of the MPEG-2 
decoder 30. Within this internal loop, the residual from the 

35 quantization block 20 is inverse quantized by an inverse 
quantization block 24 and is inverse DCT transformed by an 
inverse discrete cosine transform (IDCT) block 26. The 
predicted macroblock, that is read out of the reference 
picture memory 16 and that is supplied to the subtractor 12, 

20 is also added back to the output of the IDCT block 26 on a 
pixel by pixel basis by an adder 28, and the result is stored 
back into the reference picture memory 16 in order to serve 
as a macroblock of a reference picture for predicting sub- 
sequent pictures. The object of this internal loop is to have 

25 the data in the reference picture memory 16 of the MPEG-2 
encoder 10 match the data in the reference picture memory 
of the MPEG-2 decoder 30. B pictures are not stored as 
reference pictures. 

In the case of I pictures, no motion estimation occurs and 
the negative input to the subtractor 12 is forced to zero. In 
this case, the quantized DCT coefficients provided by the 
two dimensional DCT 18 represent transformed pixel values 
rather than residual values, as is the case with P and B 
pictures. As in the case of P pictures, decoded I pictures are 

35 stored as reference pictures. 

The MPEG-2 decoder 30 illustrated in FIG. 2 is a sim- 
plified showing of an MPEG-2 decoder. The decoding 
process implemented by the MPEG-2 decoder 30 can be 

40 thought of as the reverse of the encoding process imple- 
mented by the MPEG-2 encoder 10. The received encoded 
data is Huffman and run/level decoded by a Huffman and 
run/level decoder 32. Motion vectors and other information 
are parsed from the data stream flowing through the Huff- 

45 man and run/level decoder 32. The motion vectors are fed to 
a motion compensator 34. Quantized DCT coefficients at the 
output of the Huffman and run/level decoder 32 are fed to an 
inverse quantization block 36 and then to an IDCT block 38 
which transforms the inverse quantized DCT coefficients 

50 back into the spatial domain. 

For P and B pictures, each motion vector is translated by 
the motion compensator 34 to a memory address in order to 
read a particular macroblock (predicted macroblock) out of 
a reference picture memory 42 which contains previously 

55 stored reference pictures. An adder 44 adds this predicted 
macroblock to the residual provided by the IDCT block 38 
in order to form reconstructed pixel data. For I pictures, 
there is no reference picture so that the prediction provided 
to the adder 44 is forced to zero. For I and P pictures, the 

w output of the adder 42 is fed back to the reference picture 
memory 42 to be stored as a reference picture for future 
predictions. 

The MPEG encoder 10 can encode sequences of progres- 
sive or interlaced pictures. For sequences of interlaced 
65 pictures, pictures may be encoded as field pictures or as 
frame pictures. For field pictures, one picture contains the 
odd lines of the raster, and the next picture contains the even 
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lines of the raster. All encoder and decoder processing is full resolution reference pictures. This difference means that 

done on fields. Thus, the DCT transform is performed on the prediction derived from the reduced resolution reference 

8x8 blocks that contain all odd or all even numbered lines. will differ by some amount from the corresponding predic- 

These blocks are referred to as field DCT coded blocks. tion made by the encoder, resulting in error in the residual 

Od the other hand, for frame pictures, each picture 5 plus prediction sum provided by the adder 58 (this error is 

contains both odd and even numbered lines of the raster. referred t0 herem . 15 P 1 *"*™ error > e ™,T 

j j -.i. increase as predictions arc made upon predictions until the 

Macroblocks of frame pictures are encoded as frames in the , - _ r , , \ . \ 

i_ . , , , . ,, , reference is refreshed by the next I picture. 

sense that an encoded macroblock contains both odd and . v 

even lines. However, the DCT performed on the four blocks . compensator 62 attempts to reduce this predic- 

within each macroblock of a frame picture may be done in w "° n e ™r b y ™8 «« fcu ^solution motion vector* even 

two different ways. Each of the four DCT transform blocks ^ th 5 reference picture is at lower resolution. First, a 

in a macroblock may contain both odd and even lines (frame portion of the reference picture that includes the predicted 

DCT coded blocks), or alternatively two of the four DCT m««*lock is read from the reference picture memory 60. 

blocks in a macroblock may contain only the odd Unes of the ^ P°" 10D 15 bas f? °° aU b,ts ° f vector 

macroblock and the other two blocks may contain only the « exce P l , the l Ci f "gnificant bit. This predicted macroblock is 

even Unes of the macroblock (field DCT coded blocks). The interpolated back to full resolution by a 2x2 prediction 

coding decision as to which way to encode a picture may be "psample filter 64 Using the full resolution motion vector 

made adaptively by the MPEG-2 encoder 10 based upon ( whl 1 ch . mav mclud , e * P™ 1 ^solution), a predicted full 

which method results in better data compression. resolution macroblock is extracted from the upsampled 

,, , . , , . „ . „ , , io portion based upon all of the bits of the motion vector. Then, 

Residual macroblocks in field pictures are field DCT , downsampler 66 forms a 2x2 downsampling on the 

coded and are predicted from a reference field. Residual m resohuion macroblock fa order to match the 

macroblocks m frame pictures that are frame DCT coded are resolution of the 4x4 1DCT outpul of the downsampler 56. 

predicted from a reference fram^Residual macroblocks m m mis w thc ^ iction from lbc refcrcncc icture 

frame pictures that are field DCT coded have two blocks m 6Q is lcd to match mc ^ resolution 

predicted from one reference field and two blocks predicted residual ^ aUowi |he ^ of m rcsolmion 

from either the same or other reference field. molion vectQre ^ the m reso , ution reference picture is 

For sequences of progressive pictures, all pictures are downsampled prior to addition by the adder 58 in order to 

frame pictures with frame DCT coding and frame prediction. matcb tbc resoluuon of thc downsampled residual from the 

MPEG-2, as described above, includes the encoding and 30 downsampler 56. 

decoding of video at high resolution (HDTV). In order to There are several known good prediction upsampling/ 

permit people to use their existing NTSC televisions in order downsampling methods that tend to minimize the prediction 

to view HDTV transmitted programs, it is desirable to error caU sed by upsampling reference pictures that have 

provide a decoder that decodes high resolution MPEG-2 been downsampled with a 4x4 IDCT. These methods typi- 

encoded data as reduced resolution video data for display on 35 ca u y involve use of a two dimensional filter having five to 

existing NTSC televisions. (Reducing the resolution of eighl taps and lap values that vary both with the motion 

television signals is often called down conversion vector value for the predicted macroblock and the position 

decoding.) Accordingly, such a downcon verting decoder 0 f lhe pixel interpolated within the predicted 

would allow the viewing of HDTV signals without requiring macroblock. Such a filter not only upsamples the reduced 

viewers to buy expensive HDTV displays. 40 resolution reference to full resolution and subsequently 

There arc known techniques for making such a downcon- downsamples in a single operation, but it can also include 

verting decoder such that it requires less circuitry and is, additional Vi pixel interpolation (when required due to a 

therefore, cheaper than a decoder that outputs full HDTV fractional motion vector). (See, for example, Minimal Error 

resolution. One of these methods is disclosed in U.S. Pat. Drift in Frequency Scalability for Motion Compensated 

No. 5,262,854. The down conversion technique disclosed 45 DCT Coding, Mokry and Anastassiou, IEEE Transactions 

there is explained herein in connection with a down con- on Circuits and Systems for Video Technology , August 1994, 

vertor 50 shown in FIG. 3. Thc down converter 50 includes and Drift Minimization in Frequency Scaleable Coders 

a Huffman and run/level decoder 52 and an inverse quanti- Using Block Based Filtering, Johnson and Princen, IEEE 

zation block 54 which operate as previously described in Workshop on Visual Signal Processing and Communication, 

connection with the Huffman and run/level decoder 32 and 50 Melbourne, Australia, September 1993.) The objective of 

the inverse quantization block 36 of FIG. 2. However, such upsampling and downsampling is for the prediction 

instead of utilizing the 8x8 IDCT block 38 as shown in FIG. upsampling filter to be a close spatial domain approximation 

2, the down convcrtor 50 employs a downsampler 56 which to the effective filtering operation done by a 4x4 IDCT. 

discards the forty-eight high order DCT coefficients of an The following example is representative of the prediction 

8x8 block and performs a 4x4 IDCT on the remaining 4x4 55 upsampling/downsampling filter described in the Mokry and 

array of DCT coefficients. This process is usually referred to Johnson papers. This example is a one dimensional example 

as DCT domain downsampling. The result of this down- but is easily extended to two dimensions. Let it be assumed 

sampling is effectively a filtered and downsampled 4x4 that pixels yl and pixels y2 as shown in FIG. 4 represent two 

block of residual samples (for P or B pictures) or pixels for a dj acent blocks in a downsampled reference picture , and that 

I pictures. 60 the desired predicted block stradles the boundary between 

For residual samples, a prediction is added by an adder 58 the two blocks. The pixels yl are upsampled to the pixels pi 

to the residual samples from the downsampler 56 in order to by using a four tap filter with a different set of tap values for 

produce a decoded reduced resolution 4x4 block of pixels. each of the eight calculated pixels pi. The pixels y2 are 

This block is saved in a reference picture memory 60 for likewise upsampled to the pixels p2 by using the same four 

subsequent predictions. Accordingly, predictions will be 65 tap filter arrangement. (If the motion vector requires Vi pixel 

made from a reduced resolution reference, while predictions interpolation, this interpolation is done using linear interpo- 

made in the decoder loop within the encoder are made from lation to calculate in between pixel values based on the 
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pixels pi and p2.) From these sixteen pixels pi and pixels 
p2, an upsampled prediction consisting of eight pixels q can 
be read using the full resolution motion vector. The pixels q 
are then filtered and downsampled to pixels q' by an eight tap 
filter with a different set of tap values for each of the four 5 
pixels q'. The Johnson paper teaches how to determine the 
optimum tap values for these filters given that the reference 
picture was downsampled by a four point IDCT. The tap 
points are optimum in the sense that the prediction error is 
minimized. The Johnson and Mokry papers also show that 10 
the upsampling, linear interpolation, and downsampling 
fillers can be combined into a single eight tap filter with tap 
values that depend on the motion vector value and the 
particular pixels q' being calculated. Accordingly, this single 
eight tap filter allows four pixels q* to be calculated directly 15 
from the eight pixels yl and y2. 

The down converter 50, while generally adequate for 
progressive pictures with frame DCT coded blocks, does not 
address problems that arise when attempting to down con- 
vert sequences of interlaced pictures with mixed frame and 20 
field DCT coded blocks. These problems arise, for the most 
part, with respect to vertical prediction upsampling, and are 
described below in a one dimensional vertical context. Thus, 
for the purpose of this description, a full resolution block 
refers to an eight pixel vertical column with a downsampled 25 
block having a corresponding vertical column of four pixels. 

Let it be assumed that an eight point vertical column of 
pixels as shown in column 70 of FIG. 5 is transformed into 
DCT coefficients by an encoder utilizing an eight point DCT 
transform operation. A down con verting decoder discards the 30 
four high order coefficients for each block and performs a 
four point IDCT on the remaining coefficients (DCT domain 
downsampling). The spatial relationship between the origi- 
nal pixels x and the decoded pixels y is shown by columns 
70 and 72. The pixels y represent the stored reference 35 
picture. 

Prediction upsampling/downsampling methods, such as 
those previously referenced (Mokry, Johnson), which oper- 
ate on DCT domain downsampled reference pictures, result 4Q 
in the spatial relationships shown in FIG. 6, where the 
reference pixels y are first upsampled to produce upsampled 
reference pixels p (these approximate the original pixels x) 
and the upsampled reference pixels p are then downsampled 
to produce downsampled reference pixels q. These methods 45 
attempt to effectively reverse the DCT domain downsam- 
pling with a minimal or small error due to the discarding of 
the high order DCT coefficients when the 4x4 IDCT is 
performed. The objective is for the prediction upsampling 
filter to be a close spatial domain approximation to the 5Q 
effective filtering operation done by a 4x4 IDCT. 

The typical operation of such a filter operating vertically 
is explained as follows. A portion of the lower resolution 
reference picture consisting of two pixel blocks (e.g., the yj 
and y 2 pixel blocks of column 80) overlapped by the desired 55 
predicted block is accessed. As shown in column 82, these 
two pixel blocks are upsampled and filtered to approximate 
the full resolution reference so that the pixels pj and p 2 
approximate full resolution pixels x. Then, the pixels p 2 and 
p 2 are filtered and downsampled to produce pixels q as 60 
shown in column 84. The pixels q form the predicted block 
that is supplied to the adder 58. 

This upsampling/downsampling process can either be a 
two step filtering process, or the pixels q can be directly 
calculated from the pixels y using, for example, an eight tap 65 
filter whose filter coefficients vary with the motion vector 
value and the particular pixels q being calculated, as 
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described in the Johnson and Mokry papers. As shown in 
FIG. 7, prediction upsampling/downsampling filters can also 
include additional Vi pixel interpolation (approximation of 
pixel values between original pixels x) when the motion 
vector is fractional. 

It is noted that the pixels y due to DCT domain down- 
sampling are effectively spatially located halfway between 
the original pixels x. This spatial relationship has important 
implications because, as previously explained, the DCT 
blocks may be frame or field encoded. For example, if it is 
assumed that a full resolution frame consisting of fields A 
and B is encoded by the encoder, and if these fields are field 
DCT encoded, the DCT domain downsampling must be 
performed by a down conversion decoder separately on each 
field block. The resulting vertical spatial relationship of 
pixels in the downsampled fields a and b with respect to 
pixels in the original fields A and B is shown in FIG. 8, 
where the original encoded fields A and B are shown in 
column 90 and the downsampled fields a and b are shown in 
column 92. It should be noted that the pixels b are not evenly 
spaced between the pixels a. 

On the other hand, with frame DCT encoding, pixels from 
fields A and B are combined together into DCT blocks by the 
encoder. DCT domain downsampling on these frame DCT 
coded blocks results in the pixel spatial relationship shown 
in FIG. 9, where the original frame DCT encoded fields A 
and B are shown in column 94 and the downsampled frame 
c is shown in column 96. It should be noted that the pixels 
c are evenly spaced. 

According to the MPEG-2 standard, a picture may be 
encoded with all macroblocks containing frame DCT coded 
blocks, with all macroblocks containing field DCT coded 
blocks, or with a mix of field and frame coded macroblocks. 
Therefore, performing DCT domain downsampling as 
shown by the prior art results in reference pictures that have 
a varying pixel structure. An entire reference picture may 
have the a/b structure shown in column 92 or the c structure 
shown in column 96. On the other hand, a reference picture 
may be composed of macroblocks, some having the a/b 
structure of column 92 and others having the c structure of 
column 96. 

When forming a predicted macroblock, as previously 
explained, the reference picture must be upsampled so that 
it matches its original full resolution structure. The predic- 
tion upsampling operation is made more complicated 
because the two different reference picture pixel structures 
shown in columns 92 and 96 require different upsampling 
processes. Because the pixels in the c structured reference 
picture shown in column 96 have resulted from DCT domain 
downsampling of a frame, prediction upsample filtering 
must be performed on the reference macroblock as a frame 
to derive the A and B fields together. However, because the 
pixels in the a/b structured reference pictures shown in 
column 92 have resulted from DCT domain downsampling 
of separate fields, prediction upsample filtering must be 
performed separately on each field (a and b) of the reference 
macroblock in order to derive the A and then B fields shown 
in column 90. 

A further complication is introduced when reference 
blocks have a mixed macroblock pixel structure because 
predicted macroblocks from the reference picture may 
straddle stored reference macroblocks, some having the c 
structure and some having the a/b structure. In this case, two 
different prediction upsample processes would have to be 
executed for different parts of the same predicted macrob- 
lock. 
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Moreover, a particular disadvantage of using the c struc- 
ture shown in column 96 for reference pictures becomes 
apparent when it is necessary to do field prediction from a 
c structured reference, where the A/B structured full reso- 
lution reference contains high vertical frequencies. For 
example, if il is assumed that at full resolution the A/B 
reference is entirely composed of alternating black (A field) 
and white (B field) lines, a c structured downsampled 
reference would be composed of pixels that are approxi- 
mately gray due to the mixing of the A and B pixels that 
occurs during filtering and downsampling. However, an a/b 
structured reference would have all black pixels for the a 
field and all white pixels for the b field because each field is 
filtered and downsampled separately. If the encoder decides 
to do a field prediction from the A field, a decoder with a c 
structured reference would read a prediction consisting of 
gray pixels. However, a decoder with an a/b structured 
reference would read a much more accurate prediction from 
the a field consisting of black pixels. Thus, the a/b structure 
avoids the field "mixing" in the decoder that occurs in the c 
structure. 

The downconvcrting decoder of the present invention 
overcomes one or more of the problems inherent in the prior 
art. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, a method of 
downconverting received frame and field coded DCT coef- 
ficient blocks to reconstructed pixel field blocks, wherein the 
frame and field coded DCT coefficient blocks have motion 
vectors associated therewith, comprises the following steps: 

a) converting the received frame coded DCT coefficient 
blocks to converted field coded DCT coefficient blocks and 
performing an 1DCT on the converted field coded DCT 
coefficient blocks to produce residual or pixel field blocks; 

b) directly performing an I DCT on the received field coded 
DCT coefficient blocks lo produce residual or pixel field 
blocks; c) selecting reference pixel blocks based upon the 
motion vectors, upsampling the reference pixel blocks, and 
downsampling at least a portion of the upsampled reference 
blocks to form a prediction; and, d) adding the prediction to 
the residual field blocks lo form reconstructed field blocks. 

In a more detailed aspect of the present invention, an 
apparatus for downconverting received frame and field 
coded DCT coefficient blocks to reconstructed pixel field 
blocks comprises an I DCT and a motion compensator. The 
IDCT is arranged to convert the received frame coded DCT 
coefficient blocks to converted field coded DCT coefficient 
blocks and to perform an IDCT on the converted field coded 
DCT coefficient blocks and on the received field coded DCT 
coefficient blocks in order to produce downconverted pixel 
related field blocks. The motion compensator is arranged to 
apply motion compensation, as appropriate, to the down- 
converted pixel related field blocks in order to produce the 
reconstructed pixel field blocks. 

In a further more detailed aspect of the present invention, 
an apparatus for downconverting received frame and field 
coded DCT coefficient blocks to downconverted pixel 
related field blocks comprises first and second IDCT's. The 
first IDCT is arranged to convert the received frame coded 
DCT coefficient blocks to converted field coded DCT coef- 
ficient blocks and to perform a downconverting I DCT on the 
converted field coded DCT coefficient blocks in order to 
produce first downconverted pixel related field blocks. The 
second IDCT is arranged to directly perform a downcon- 
verting IDCT on the received field coded DCT coefficient 
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blocks in order to produce second downconverted pixel 
related field blocks. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 These and other features and advantages of the present 
invention will become more apparent from a detailed con- 
sideration of the invention when taken in conjunction with 
the drawings in which: 

1Q FIG. 1 is a simplified block diagram of a known MPEG-2 
encoder; 

FIG. 2 is a simplified block diagram of a known MPEG-2 
decoder; 

FIG. 3 is a block diagram of a known down conversion 
15 decoder for an HDTV application; 

FIGS. 4—9 show exemplary sets of pixel data useful in 
describing the background of the present invention; 

FIG. 10 is a block diagram of a downconverting decoder 
2o according to the present invention; 

FIGS. 11-14 show additional exemplary sets of pixel data 
useful in describing the present invention; 

FIG. 15 shows a block diagram of a motion compensator 
of FIG. 9 in additional detail; 

25 FIG. 16 shows an additional exemplary set of pixel data 
useful in describing the present invention; 

FIG. 17 shows an embodiment of an IDCT module of 
FIG. 9; and, 

3Q FIGS. 18-20 show an exemplary set of pixel data useful 
in describing an alternative embodiment of the present 
invention. 

DETAILED DESCRIPTION 

35 According to one embodiment of the present invention, 
reference pictures are stored with a predetermined vertical 
pixel structure regardless of whether the received DCT 
blocks are field or frame coded. For example, the reference 
pictures are always stored with the a/b vertical pixel struc- 

40 ture shown in column 92 regardless of whether the received 
DCT blocks are field or frame coded. This consistent vertical 
pixel structure for the reference pictures allows both field 
and frame prediction with prediction upsampling to be done 
in a more straight forward manner because the pixel struc- 

45 ture of the reference picture is always known. 

A downconverting decoder 100 according to an embodi- 
ment of the present invention is shown in FIG. 10. A 
Huffman and run/level decoder, data parser, and inverse 
quantizer (not shown), which are located upstream of the 

50 downconverting decoder 100, all operate as described 
above. The resulting DCT coefficient blocks from the Huff- 
man and run/level decoder and the inverse quantizer are 
provided to an IDCT module 102 of the downconverting 
decoder 100. Other information, such as DCT type (frame or 

55 field) and picture type (frame or field) are provided to the 
IDCT module 102 from the data parser. The data parser also 
provides the motion vector, prediction type (frame or field), 
and field select signals to a motion compensator 104. 
The IDCT module 102 converts frame DCT coded blocks 

60 to field DCT coded blocks and performs DCT domain 
filtering, downsampling, and inverse DCT. In the case of 
field DCT coded blocks, the IDCT module 102 merely 
performs DCT domain filtering, downsampling, and inverse 
DCT. The residual pixel and intra pixel output of the IDCT 

65 module 102 has an a or b field structure and is fed to an adder 
106. For the case of residual pixels, the other input of the 
adder 106 receives predicted pixels from the motion com- 
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pensator 104. The output of the adder 106 consists of 
reconstructed blocks of field pixels which are provided to an 
interpolation module 108 and to a reference picture memory 
110. The interpolation module 108 does a position adjust on 
the b field pixels for correct display. The field pixel blocks 
(for 1 and P pictures) are stored in the reference picture 
memory 110 for future predictions. The motion compensator 
104 reads predicted pixel blocks from the reference picture 
memory 110 as required. 

When frame pictures are received, the macroblocks of the 
frame pictures may be field or frame DCT coded. If a 
macroblock is frame DCT coded as indicated by the DCT 
type signal, the 1DCT module 102 will convert the first two 
vertically stacked frame DCT coded blocks to two field DCT 
coded blocks and then perform a 4x4 IDCT on each of the 
two blocks. This process will be described below in greater 
detail and it will be shown that this process can be done in 
a very efficient manner. The same process is performed on 
the next two vertically stacked blocks in that macroblock. 
The result is as if the macroblock was originally field coded. 
However, if the macroblock is field DCT coded to begin 
with as indicated by the DCT type signal, then the IDCT 
module 102 just performs 4x4 IDCTs on each block in that 
macroblock. The result is that the output of the IDCT 
module 102 is always an a/b structured macroblock as 
shown in column 92 because the c structure shown in 
column 96 is avoided. Therefore, reference pictures will 
always be stored with the a/b pixel structure, which simpli- 
fies prediction upsampling. 

When field pictures are received, the macroblocks of the 
field pictures input to the IDCT module 102 will always be 
field coded. The output of the IDCT module 102 will be 
either an a or a b structured macroblock. 

As explained above in the discussion of prior art, the 
motion compensator with prediction upsampler/ 
downsampler filter should use the motion vector to select an 
area from the reference picture memory that includes the 
blocks overlapped by the desired predicted macroblock. 
This area is then upsampled to full resolution (including Yi 
pixel interpolation if required). Then the motion vector is 
used to select the full resolution predicted macroblock. The 
full resolution predicted macroblock is then filtered and 
downsampled to match the structure of the 4x4 IDCT 
residual output. This process can be done in three steps, or 
can be implemented as a single step. Horizontally, this 
process is the same as in the prior art. Vertically, the motion 
compensator 104 must support two types of prediction 
upsampling/downsampling. These two types are field pre- 
diction and frame prediction, and are indicated by the 
prediction type signal. 

Field Prediction — When the prediction type signal indi- 
cates field such that field prediction from an a or b reference 
field is required, the motion vector is used to read pixels 
from a particular area of a downsampled reference field a or 
b. Then, a prediction upsampling/downsampling filter 
(Mokry or Johnson or similar type) operates horizontally 
and then vertically on these pixels to form a predicted field 
macroblock. This operation is shown in FIG. 11 with respect 
to a downsampled reference field a. Column 120 of FIG. 11 
represents a DCT domain downsampled reference field a as 
stored in the reference picture memory 110. Column 122 
represents an upsampled portion of this reference field, and 
column 124 represents the prediction after downsampling. 

The upsampling/downsampling filler may include addi- 
tional V5 pixel interpolation if the motion vector is fractional. 
FIG. 12 shows an example of vertical upsampling/ 
downsampling from a reference field a when Vi pixel inter- 
polation is implemented. 

It should be noted that, in the case of frame pictures, there 
will be two field predictions for each macroblock utilizing 
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two motion vectors. One field prediction is for the A field, 
and one prediction is for the B field. Each field prediction is 
operated upon separately by the prediction upsampling/ 
downsampling filter, including additional V£ pixel interpo- 

5 lation if the motion vector is fractional. 

Frame Prediction — Frame prediction is required from an 
a/b reference picture for a frame DCT coded residual mac- 
roblock. Due to operation of the IDCT module 102, the 
residual macroblock output from the IDCT module 102 has 
the a/b structure. The motion vector for the macroblock is 
used to read a particular area from the a/b structured 
downsampled reference picture stored in the reference pic- 
ture memory 110. Horizontal prediction upsampling/ 
downsampling is done as in the case of field prediction. 
Vertical prediction upsampling/downsampling must be bro- 

15 ken up into separate steps. The a and b fields from the read 
area are separately vertically upsampled (here, both V£ pixel 
interpolation and subsequent downsampling are postponed 
to a later step) in order to approximate the full resolution A 
and B fields which are shown by steps 1 and 2 of FIG. 13. 

20 If the motion vector is fractional, Vt pixel interpolation is 
performed as follows. The A and B upsampled field areas are 
line shuffled (see step 3 of FIG. 14). Then Yi pixel vertical 
interpolation is performed as shown by step 4 of FIG. 14 in 
order to create A'/B' upsampled field areas. This process 

25 matches how the encoder does Yi pixel interpolation on 
frame coded macroblocks. It would not be correct to perform 
Yi pixel interpolation separately on the fields. Finally, the A 
and B (or A' and B 1 ) portions of the upsampled prediction are 
separately filtered and downsampled to create a prediction 
for the a/b structured residual macroblock (see step 5 of FIG. 
14). It should be noted that, unlike the situation for field 
prediction, the frame prediction upsample/downsample pro- 
cess must be split into separate filtering steps. 

The motion compensator 104 is shown in more detail in 
FIG. 15. The motion compensator 104 executes the two 

35 required types of prediction upsampling/downsampling 
based on the prediction type signal (field or frame). The 
prediction type signal is distributed to a motion vector 
translator 200, to a vertical upsample/downsample filler 202, 
to a combiner 204, to an interpolator 206, and to a vertical 

40 filter and downsampler 208. The motion vector translator 
200 also accepts the motion vector from the data parser as 
an input. The motion vector translator 200 translates the 
motion vector to a series of reference memory addresses in 
order to read pixels stored in the reference picture memory 

45 110 from reference fields a and b when the prediction type 
is for frame prediction, or from a reference field a or a 
reference field b (as indicated by the field select signal) when 
the prediction type indicates field prediction. 
The pixels that are read out from the reference picture 

5Q memory 110 are provided to a horizontal upsample/ 
downsample filter 210. The horizontal upsample/ 
downsample filter 210 executes a horizontal prediction 
upsampling/downsampling filtering operation as previously 
described, including Yi pixel interpolation if the motion 
vector is fractional. The horizontally processed pixels are 

55 then provided to the vertical upsample/downsample filter 
202. If the type of prediction is field prediction, the vertical 
upsample/downsample filter 202 executes vertical 
upsampling/downsampling, with additional Yi pixel interpo- 
lation if the motion vector is fractional. The combiner 204, 

60 the interpolator 206, and the vertical filter and downsampler 
208 simply pass through the output of the vertical upsample/ 
downsample filter 202 during field prediction. 

For frame prediction, however, the vertical upsample/ 
downsample filter 202 performs only vertical upsampling on 

65 the horizontally processed pixels provided by the horizontal 
upsample/downsample filler 210. In this case, Yi pixel pre- 
diction (if necessary) and downsampling are executed later. 
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The combiner 204 operates only during frame prediction and 
when the motion vector is vertically fractional by line 
shuffling the upsampled A and B field blocks. If the predic- 
tion type is frame and if the motion vector is vertically 
fractional, the interpolator 206 executes Vi pixel linear 5 
vertical interpolation. If the prediction type is frame, the 
vertical filter and downsampler 208 separately filters and 
downsamples the A and B (or A' and B*) fields. The output 
of the vertical filter and downsampler 208 is the desired 
prediction. Accordingly, the combiner 204 and the interpo- io 
lator 206 operate only when there is frame prediction and the 
motion vector is fractional, and the vertical filter and down- 
sampler 208 operates only when there is frame prediction. 
When there is field prediction, the combiner 204, the inter- 
polator 206, and the vertical filter and downsampler 208 15 
simply pass through the output of the vertical upsample/ 
downsample filter 202. 

The b field pixels at the output of the adder 106 of FIG. 
10 do not fall directly in between the a field pixels, as shown 
by column 220 of FIG. 16. Accordingly, the interpolation 20 
module 108 performs a simple linear interpolation, as shown 
in FIG. 16, so that the b field pixels are repositioned halfway 
in between the a field pixels, as shown in column 224. 
Accordingly, the b field pixels will be properly displayed. 



where the left band side of equation (1) comprises two one 
dimensional vertical blocks each containing eight frame 
DCT coefficients. An 8x8 DCT matrix [T8] is defined by the 
following equation: 
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where the rows of the right hand side of equation (2) contain 
the well known values for the eight point DCT basis vectors. 
Further, an IDCT operator [IT8 2 ] (where IT represents 
inverse transform) for two vertically stacked blocks is 



tion: 
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(3) 



Alternatively, a longer FIR filter with an even number of 25 derived from equation (2) according to the following equa- 
taps could be used. 

The IDCT module 102 of FIG. 10 is described in addi- 
tional detail subsequently to the following mathematical 
derivation. This mathematical derivation discloses an effi- 
cient method of converting two vertically stacked frame 
coded 8x8 blocks of DCT coefficients into two blocks of 
field DCT coefficients which are then vertically inverse 
discrete cosine transformed by a four point IDCT operation 
that effectively vertically filters and downsamples the blocks 
as separate fields resulting in an a/b spatial pixel structure. 
This mathematical derivation demonstrates that the vertical 
processing can be done efficiently in a single matrix opera- 
tion. This derivation is first shown one dimensionally for a 
single column of sixteen pixels (two vertically stacked one 
dimensional "blocks") and is then shown for two dimen- 
sional blocks of multiple columns. With respect to this 
derivation, an uppercase X refers to frequency domain DCT 
coefficients and a lowercase x or y refers to spatial domain 
pixels or residual values. 

First, the following equation provides an initial definition: 
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Equations (1) and (3) can be combined as follows: 
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where the two blocks [x] on the right hand side of equation 
(4) are in the spatial domain. Next, a matrix [T^.] may be 
65 defined, based on equation (2), according to the following 
equation: 
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where the rows of equation (5) are the eight point DCT basis 
vectors with zero's placed between each coefficient. 

Equations (4) and (5) can be combined according to the 
following equation: 
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where [x] is frame ordered, but where [Xab] comprises two 
field DCT coded blocks for fields a and b. 

Accordingly, frame DCT coefficients [X] can be con- 
verted to field DCT coded coefficients [Xab] in a single 50 
operation according to the following equation: 

IXabHT^xHTjlITSaXHQiIX] (7) 

where [QJ is an operator given by the following equation: 55 

I&HT-jimy (8) 

The frame DCT coded coefficients [X] can be convened 
to two separate field blocks [xab] in the spatial domain 60 
according to the following equation: 



where [xab] comprises two separate field blocks. 

The matrix [IT8 2 ] can be modified so that downsampling 
and filtering can be added. Thus, an 8x8 matrix [P4] can be 
formed from a 4x4 DCT basis matrix [T4] by padding the 
well known four point DCT basis vectors with zero's in both 
direction according to the following equations: 



|74] = 



(9) 
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(11) 



(12) 



I* 



where Q 2 is a 16x16 matrix operator according to the 65 

following expression: [Q a ]^rT8 3 ]T / .][JT82]. The quantity Then, the transpose of equation (12) may be applied to a 
[xab] is also given by the following equation: column of eight pixels according to the following equation: 
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0 


X 




0 



(13) 



where the y's on the right hand side of equation (13) are the 
filtered and downsampled spatial pixels resulting from the 
inverse DCT operation on the left hand side of equation 13. 
Only the four pixels y need to be retained such that the zero's 
on the right hand side of equation (13) may be discarded. 
These pixels y represent a filtered and downsampled version 
of an original eight pixel block [x]. A matrix [IP4J, which 
is an operator for two vertical blocks, may be established 
according to the following equation: 



[tp-h] = 



P4 i 



0 

P4 T 



The zero's on the right hand of equation (16) can be 
discarded so that only the ya's and yb's arc retained. The 
output, therefore, of applying the operator [Q 3 ] to two frame 
DCT coded blocks is separate top and bottom field blocks 
for fields a and b, each separately filtered and downsampled 
effectively using a 4x4 I DCT operating on field DCT coded 
blocks. 



20 



(14) 



25 



where the matrix is a filter/downsample/IDCT opera- 
tor. The operator [IT8 2 ] which is used to perform an IDCT 
on [X] to produce [x], the operator [T^] which is used to 
perform a field split in order to produce [Xab], and the 
operator [IP4J which performs filtering, downsampling, 30 
and an IDCT separately on each field to derive [y] can be 
combined according to the following equation so that two 
vertical blocks of frame DCT coded coefficients can be 
filtered, downsampled, and inverse discreet cosine trans- 
formed in a single operation: 35 

(CH'^IT^/TSJ (15) 

Thus, by applying the operator [Q 3 ], which is a 16x16 
operator, to two vertical blocks of DCT coded coefficients 
given by equation (1), the following results are produced: 40 



ya]] (16) 
ya 
ya 
ya 
0 
0 
0 
0 

yb 
yb 
yb 
yb 
0 
0 
0 
0 
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Because [Q 3 ] has the following form: 



the operator [Q 3 ] can be rewritten in the following form: 



(18) 



where [p] and [q] are each 4x16 matrices. Accordingly, the 
operator [Q 3 ] becomes an 8x16 operator instead of a 16x16 
operator so that, when it is applied to two frame DCT coded 
blocks, only the ya's and yb's of equation (16) results. 

The following description shows how the operator [Q 3 ] 
may be used on two dimensional blocks having eight 
columns, and also shows the horizontal IDCT with filtering 
and downsampling. For a full two dimensional field based 
4x4 IDCT of frame DCT coded blocks, the [Q 3 ] operation 
is only performed vertically. A standard four point IDCT is 
performed horizontally. Let X be a macroblock consisting of 
four frame coded 8x8 DCT blocks X 1( X^ X 3 , and X 4 . 
Spatially, these 8x8 DCT coded blocks are oriented as 
follows: 

x 3 ' x 4 



As a first step, the high order horizontal coefficients of 
each of the 8x8 DCT coded blocks X |f X 2 , X 3 , and X< are 
discarded (the four high order coefficients in each row) so 
that each DCT block becomes an 8x4 block (X^, X 2 ', X 3 \ 
and Xy). A 16x4 matrix [X m ] may be used to define the two 
8x4 frame coded DCT blocks and X 3 ' according to the 
following equation: 



I*.'] 1 
!*)'] I 



(19) 



and similarly a 16x4 matrix [XJ may be used to define the 
4 frame coded 8x4 DCT blocks X 2 ' and X 4 ' according to the 
following equation: 



[*„] = 



50 



(20) 



At this point, the vertical operation can be performed first, 
followed by the horizontal operation, or the horizontal 
operation may be performed first followed by the vertical 
55 operation. Assuming that the vertical operation is performed 
first; field based DCT domain vertical filtering and down- 
sampling is performed according to the following equation: 



(CI 
[<*] 



(21) 



where [G] is an 8x8 matrix, [Q 3 ] is an 8x16 matrix, [X m ] is 
the 16x4 matrix described above, and [GJ and [G b ] are 4x4 
65 matrices for corresponding fields a and b. Then, horizontal 
DCT domain filtering and downsampling is performed 
according to the following equation: 
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bo] 



[Ga\ 
[Gb] 



[T4\ 



where [ya] and [yb] arc each a 4x4 matrix, where [Ga] and 
[Gb] are the 4x4 matrices derived from equation (21), where 
the ya and yb are the resulting residual or pixel values 
provided by the IDCT module 102 to the adder 106, and 
where [T4] is the four point DCT basis vector 4x4 matrix 
from equation (11). These operations represented by equa- 
tions (21) and (22) are also applied thereafter to the 16x4 
array [XJ 

On the other hand, the order of the above operations can 
be reversed in the case of first performing horizontal 
downsampling, followed by vertical downsampling with the 
same results. Thus, the four point DCT basis vector matrix 
T4 is applied to the 8x4 array [X/] and the 8x4 array [X 3 ] 
according to the following equations: 



The results [Gj and [G 2 ] can be combined accordiog to the 
following equation: 



-IK 



Thereafter, a vertical field based operation may be per- 
formed according to the following equation: 



bb) 



= [QS]\ 



[<m] 
(Gil 



of the blocks provided to it. The vertically processed 4x4 
(22) blocks are fed to a selector 308. Because the DCT type is 
field, the 4x4 blocks provided by the matrix multiplier 306 
are selected for output to a matrix multiplier 310. The matrix 
5 multiplier 310 executes a four point horizontal IDCT on 
each row of the blocks supplied to it. The output of the 
matrix multiplier 310 is a 4x4 block of filtered and down- 
sampled pixels for field a or b. 

If a macroblock X consists of frame DCT coded blocks as 
10 signaled by the DCT type, the blocks of the macroblock are 
fed in turn by the parser 302 to a discard module 312 which 
discards the high order horizontal coefficients in order to 
form corresponding 8x4 frame DCT coded blocks. These 
8x4 blocks X,', Xj', X 3 ', and X,,' are provided to a reorder 
1 5 module 314 where these 8x4 blocks are stored and reordered 
and then provided as two 16x4 blocks X^, (X^ and X 3 ') and 
X„ (X 2 " and X 4 '). The 16x4 block X„ is first fed to a matrix 
multiplier 316 which, using the Q 3 operator, applies vertical 
conversion, filtering, and downsampling to the field DCT 
20 coefficients. The output of the matrix multiplier 316 is an 
8x4 block consisting of Ga and Gb. A store and delay 
module 318 then separately outputs the 4x4 blocks Ga and 
Gb to the selector 308. Because the current macroblock X 
being processed is of type frame, the selector 308 applies the 
25 4x4 Ga and Gb blocks to the matrix multiplier 310 which, 
operating first on the 4x4 block Ga, executes a four point 
IDCT on each row of the block, and then subsequently 
processes the 4x4 block Gb in a similar fashion. The outputs 
of the matrix multiplier 310 are 4x4 blocks of filtered and 
30 downsampled pixels ya for the a field and yb for the b field. 
The IDCT module 102 can be modified in order to permit 
the elimination of the interpolation module 108. That is, the 
four point IDCT previously described involves the use of 
four point DCT basis vectors (the [T4] operator) on the four 
35 low order DCT coefficients originally derived from an eight 
(26) P°i Qt DCT operation in the encoder. This process results in 
the "in between" downsampled pixel positioning as previ- 
ously described and as replicated in FIG. 18. In order to 



(23) 



(24) 



(25) 



permit the elimination of the interpolation module 108, an 
FIG. 17 shows an exemplary hardware implementation of 40 alternative operator [T4'] can be derived. 



an IDCT 300 which can be used for the IDCT module 102 
of FIG. 10. The IDCT 300 processes both field DCT and 
frame DCT coded macroblocks. The output of the I DCT 300 
is always a or b structured pixels. For field DCT coded 
blocks, the usual 4x4 IDCT is executed to achieve horizontal 
and vertical filtering and downsampling. For frame DCT 
coded blocks, the horizontal processing is the same, but 
vertically the Q 3 operator is used to effectively convert the 
frame coding to field coding, and then to filter and down- 
sample each field separately. The result is a or b structured 
field blocks in either case. 

It is noted here that matrix multiplication is not commu- 
tative (a*b does not in general equal b*a). Therefore, each 
matrix multiplier of the IDCT 300 has a pre and a post input 
to indicate the order of the associated matrix multiplication. 

A macroblock X consisting of four 8x8 DCT blocks X lt 
X^ X 3 , and X 4 is received by a parser 302 which sends the 
DCT blocks along one path of the IDCT 300 if the DCT 
blocks arc field coded blocks and sends the DCT blocks 
along a different path of the IDCT 300 if the DCT blocks are 
frame coded blocks. If the four 8x8 DCT blocks are field 
DCT coded blocks as signaled by a DCT type input to the 
parser 302, each block is fed to a discard module 304 which 
discards the high order horizontal and vertical coefficients in 
order to form a 4x4 field DCT coded block for each of the 
8x8 DCT field coded blocks supplied to it. A matrix multi- 
plier 306 executes a four point vertical IDCT on the columns 



45 



50 



55 



60 



65 



Thus, instead of consisting of the four point DCT basis 
vectors, the rows of the alternative operator [T4'] consist of 
downsampled eight point DCT basis vectors. If each row of 
the alternative operator [T4'] contains the second sample, the 
fourth sample, the sixth sample, and the eighth sample, 
respectively, of the four low order eight point DCT basis 
vectors, and if the decoder discards the four high order 
coefficients for each block and performs a four point IDCT 
on the remaining coefficients using the alternative operator 
[T4'], then the spatial relationship between the original 
pixels (x's) and the decoded pixels (y's) is shown in FIG. 19. 
Thus, if the [T4] operator is used to downsample the A field, 
and the alternative operator [T4 1 ] is used to downsample the 
B field, then the resulting spatial relationship is shown in 
FIG. 20. It is noted that this is the desired relationship as 
shown by the column 224 of FIG. 16. 

Therefore, it is desired to modify the IDCT module 102 so 
that its output always has the a/b pixel structure of FIG. 20. 
For field coded macroblocks of the A field, a four point 
IDCT (using the operator [T4]) is performed, as before. But 
for field coded macroblocks of the B field, the alternative 
operator [T4 1 ] derived from the eight point basis vectors is 
used for IDCT. 

For frame coded macroblocks, a modified operator, simi- 
lar to the operator Q 3 must be derived that computes two 
separately downsampled fields directly from the frame 
coded coefficients. The modified operator Q 3 ' must incor- 
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porale the operator [T4] for the A field and the alternative 
operator [T4'] for the B field. The operator [T4] is given by 
equation (11) and the operator [P4] is given by equation 
(12), where the DCT basis vectors rOO, rOl, . . . , r33 are four 
point DCT basis vectors. The alternative operators [T4'] and 
[P4'] may be given by the following equations: 
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(27) 



(28) 



where the DCT basis vectors tOl, t03, . . . t37 are eight point 
DCT basis vectors. The alternative operator [P4'] does not 
need the ^1 scaling factor which is incorporated into the 
operator [P4] because the alternative operator [P4 1 ] uses 
eight point basis vectors. Accordingly, the alternative opera- 
tor [IP4' 2 ] is given by the following equation: 



t/P^l = 



\P4 T 



0 

r* 7 



(29) 



where the operator [IP4' 2 ] is a filter/do wnsample/IDCT 
operator which clearly operates differently on the A field 
(using P4 7 ) than on the B field (using P4 ,? ). 

A modified operator Q 3 ', therefore, is defined according to 
the following equation: 



(30) 



This modified operator Q 3 ' is used by the 1 DCT module 102 
instead of the operator Q 3 previously described. The IDCT 
module 102 using these appropriate operators results in an 
a/b pixel structure that eliminates the need for the interpo- 
lation module 108. 

As previously stated, the use of a prediction upsampling 
filter should result in a close spatial domain approximation 
to the effective filtering operation performed by a 4x4 IDCT. 
Because the two fields are now filtered differently using the 
4x4 IDCT, the two fields must be filtered differently by the 
motion compensator 104. The basic filler structure is the 
same for both fields, only the tap values differ. The tap 
values for both fields can easily be derived and stored in 
memory. 

The following are representative values for the matrices 
[T8} and [T4]: 
[T8]- 

03536 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 
0.4904 0.4157 0.2778 0.0975 -0.0975 -0.2778 -0.4157 
-0.4904 

0.4619 0.1913 -0.1913 -0.4619 -0.4619 -0.1913 0.1913 
0.4619 

0.4157 -0.0975 -0.4904 -0.2778 0.2778 0.4904 0.0975 
-0.4157 

03536 -0.3536 -0.3536 0.3536 0.3536 -03536 -0.3536 
0.3536 



0.2778 -0.4904 0.0975 0.4157 -0.4157 -0.0975 0.4904 
-0.277S 

0.1913 -0.4619 0.4619 -0.1913 -0.1913 0.4619 -0.4619 

0. 1913 

0.0975 -0.2778 0.4157 -0.4904 0.4904 -0.4157 0.2778 

-0.0975 
[T4]- 

0.5000 0.5000 0.5000 0.5000 
0.6533 0.2706 -0.2706 -0.6533 
' 0.5000 -0.5000 -0.5000 0.5000 
0.2706 -0.6533 0.6533 -0.2706 

However, it should be understood that these values are 
merely exemplary and other values could be used. 
15 Certain modifications of the present invention have been 
discussed above. Other modifications will occur to those 
practicing in the art of the present invention. For example, 
according to the description above, the interpolation module 
108 can be eliminated through derivation of an alternative 
20 operator [T4']. 

Accordingly, (he description of the present invention is to 
be construed as illustrative only and is for the purpose of 
teaching those skilled in the art the best mode of carrying out 
the invention. The details may be varied substantially with- 
out departing from the spirit of the invention, and the 
exclusive use of all modifications which are within the scope 
of the appended claims is reserved. 
What is claimed is: 

1. A method of downconverting received frame and field 
coded DCT coefficient blocks to reconstructed pixel field 
blocks, wherein the frame and field coded DCT coefficient 
blocks have motion vectors associated therewith, the method 
comprising the following steps: 

a) converting the received frame coded DCT coefficient 
blocks to converted field coded DCT coefficient blocks 
and performing an IDCT on the converted field coded 
DCT coefficient blocks to produce residual or pixel 
field blocks; 

b) directly performing an IDCT on the received field 
coded DCT coefficient blocks to produce residual or 
pixel field blocks; 

c) selecting reference pixel blocks based upon the motion 
vectors, upsampling the reference pixel blocks, and 
downsampling at least a portion of the upsampled 
reference blocks to form a prediction; and, 

d) adding ihe prediction to the residual field blocks to 
form reconstructed field blocks. 

2. The method of claim 1 wherein step a) comprises the 
following steps: 

al) dropping high order DCT coefficients from the frame 
coded DCT coefficient blocks to produce downsized 
frame coded DCT coefficient blocks; 
a2) reordering the downsized frame coded DCT coeffi- 
cient blocks; 

a3) converting the downsized frame coded DCT coeffi- 
cient blocks to resulting field coded DCT coefficient 
blocks and performing an IDCT by columns on the 
resulting field coded DCT coefficient blocks; and, 

a4) separately performing an IDCT by rows on results 
from step a3). 

3. The method of claim 2 wherein step a3) comprises the 
step of multiplying the resulting field coded DCT coefficient 
blocks by [03], wherein [Q3]-[IP4 2 ][T / IlT8 2 ], wherein 
[IP4 2 ] is in the following form: 
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f P4 T 0 
0 P4 T 



wherein [P4] is in the following form: 



rOO rOl r02 r03 0 0 

riO rll rl2 r!3 0 0 

r20 rll rll r23 0 0 

r30 r31 r32 r33 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 



0 0 

0 0 

0 0 

0 0 



/vr 



wherein rOO, rOl, . . . r33 are four point DCT basis vector, 
wherein [Ty] is in the following form: 



\T4] = 



rOO rOl r02 r03 

rlO rll rll rl3 

rlQ r2l rll r!3 

r30 r3l r32 r33 



5. The method of claim 1 wherein step b) comprises the 
following steps: 

io bl) dropping high order DCT coefficients from the field 

coded DCT coefficient blocks to produced downsized 

field coded DCT coefficient blocks; 
b2) performing an I DCT by columns on the downsized 

field coded DCT coefficient blocks; and, 
]5 b3) separately performing an IDCT by rows on results 

from step b2). 

6. The method of claim 5 wherein step b2) comprises the 
step of multiplying the downsized field coded DCT coeffi- 
cient blocks by [T4f>^, wherein [T4] is in the following 
form: 



[Tf] = 
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wherein [IT8 2 ] is in the following form: 
[73 r o 1 

"Ho ^. 



wherein [T8] is in the following form: 

50 

(00 (Of (02 (03 (04 t05 i06 (07 
(10 til ill (13 (14 (15 (16 (17 
(10 (21 (11 (23 (24 (25 (26 (27 
(30 (31 (32 (33 (34 (35 t36 (37 

[731 = 

(40 (41 (42 (43 (44 (45 (46 (47 
(50 (51 (52 t53 (54 (55 (56 (57 
(60 (61 (62 (63 (64 (65 (66 (67 
(70 (71 (72 (73 (74 (75 (76 (77 



and wherein 100, tOl. . - - t77 are eight point DCT basis 
vectors. 

4. The method of claim 3 wherein step a4) comprises the 
step of multiplying results from step a3) by [T4y^2, wherein 
[T4] is given by the following equation: 



rOO rOl rOl r03 

rlO rll rll r!3 

\T4] = 

1 J r20 r2l r22 r23 

j30 r3t r32 r33 

and wherein rOO, rOl, . . . r33 are four point DCT basis 
vectors. 

7. The method of claim 6 wherein step b3) comprises the 
step of multiplying results from step b2) by [T4]/^. 

8. The method of claim 7 wherein step a) comprises the 
following steps: 

55 al) dropping high order DCT coefficients from the frame 
coded DCT coefficient blocks to produced downsized 
frame coded DCT coefficient blocks; 
a2) reordering the downsized frame coded DCT coeffi- 
cient blocks; 

60 a3) converting the downsized frame coded DCT coeffi- 
cient blocks to resulting field coded DCT coefficient 
blocks and performing an IDCT by columns on the 
resulting field coded DCT coefficient blocks; and, 
a4) separately performing an IDCT by rows on results 

65 from step a3). 

9. The method of claim 8 wherein step a3) comprises the 
step of multiplying the resulting field coded DCT coefficient 
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blocks by [03], wherein [QSHIP^JDVBTSJ, wherein 
[IP4J is in the following form: 



[ip-h] = 



\pf o 

0 P4 T 



wherein [P4] is in the following form: 



{«] = 



tOO rt>t r02 r03 0 

rlO rlt rl2 r!3 0 

r20 r2l r22 r23 0 

r30 r3l r32 r33 

0 0 0 0 
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0 0 

0 0 

0 0 

0 0 



0 0 

0 0 

0 0 

0 0 



0 0 
0 0 



wherein rOO, rOl, . . . r33 are four point DCT basis vectors, 
wherein [Ty] is in the following form; 
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al) dropping high order DCT coefficients from the frame 
coded DCT coefficient blocks to produced downsized 
frame coded DCT coefficient blocks; 

a2) reordering the downsized frame coded DCT coeffi- 
cient blocks; 

a3) converting the downsized frame coded DCT coeffi- 
cient blocks to resulting field coded. DCT coefficient 
blocks and performing an IDCT by columns on the 
resulting field coded DCT coefficient blocks; and, 

a4) separately performing an IDCT by rows on results 
from step a3). 

12. The method of claim 11 wherein step c) comprises the 
following steps: 

cl) translating the motion vector to a reference address; 

c2) horizontally upsampling and downsampling the ref- 
erence pixel blocks based on the reference address; 

c3) for field prediction, vertically upsampling and down- 
sampling the horizontally upsampled and down- 
sampled reference pixel blocks; and, 

c4) for frame prediction, (i) vertically upsampling the 
horizontally upsampled and downsampled reference 
pixel blocks separately for fields a and b, (ii) if the 



tOO 0 (01 0 

(10 0 (11 0 

(20 0 12/ 0 

(30 0 (31 0 

(40 0 (41 0 

150 0 (51 0 

(60 0 (61 0 

(70 0 (71 0 

0 (00 0 tOl 

0 (tO 0 ill 

0 t20 0 (21 

0 (30 0 (31 

0 (40 0 (41 

0 (50 0 (51 

0 (60 0 (61 

0 (70 0 (71 



(02 0 (03 0 

(12 0 (13 0 

(22 0 (23 0 

1 32 0 (33 0 

(42 0 (43 0 

(52 0 (53 0 

(62 0 (63 0 

(72 0 (73 0 

0 (02 0 (03 

0 (12 0 (13 

0 t22 0 (23 

0 t32 0 /3J 

0 (42 0 f-tf 

0 (52 0 /JJ 

0 rf2 0 (63 

0 r72 0 (73 



(04 0 (05 Q 

(14 0 0 

(24 0 /2J 0 

(34 0 (35 0 

(44 0 /45 0 

(54 0 /JJ 0 

0 (65 0 

/74 0 (75 0 

0 /« 0 (05 

0 //<* 0 ;75 

0 (24 0 i25 

0 (34 0 fJ5 

0 /•# 0 (45 

0 (54 0 (55 

0 (64 0 (65 

0 (74 0 ;75 



f06 0 (07 0 

r/fi 0 ;/7 0 

(26 0 (27 0 

(36 0 (37 0 

r46 0 r47 0 

f 56 0 f57 0 

r66 0 (67 0 
f7<5 0/77 0 

0 (06 0 (07 

0 r/6 0 //7 

0 (26 0 r27 

0 (36 0 137 

0 rifi 0 (47 

0 fJfi 0 (57 

0 rtS<5 0 rfi7 

0 r7<5 0 ;77 



wherein [rT8 2 ] is in the following form: 
In? o 1 

■"Ho 



wherein [T8] is in the following form: 

rt» (01 (02 (03 (04 (05 (06 (07 

(10 (11 (12 (13 (14 (15 (16 (17 

(20 (21 (22 (23 (24 (25 (26 (27 

(30 (31 (32 (33 (34 (35 (36 (37 

1 ' (40 (41 (42 (43 (44 (45 (46 (47 

(50 (51 (52 (53 (54 (55 (56 (57 

(60 (61 (62 (63 (64 (65 (66 (67 

(70 (71 (72 (73 (74 (75 (76 (77 

and wherein tOO, t01, . . . 177 are eight point DCT basis 
vectors. 

10. The method of claim 9 wherein step a4) comprises the 
step of multiplying results from step a3) by \T4y^l. 

11. The method of claim 5 wherein step a) comprises the 
following steps: 



motion vectors are fractional, combining the separately 
45 upsampled a and b fields into a reference frame block, 
then performing a half pixel interpolation on the com- 
bined frame block, and (iii) separately vertically down- 
sampling the upsampled a and b fields. 

13. The method of claim 1 wherein step c) comprises the 
following steps: 

50 cl) translating the motion vector to a reference address; 
c2) horizontally upsampling and downsampling the ref- 
erence pixel blocks based on the reference address; 
c3) for field prediction, vertically upsampling and down- 
sampling the horizontally upsampled and down- 

55 sampled reference pixel blocks; and, pi c4) for frame 
prediction, (i) vertically upsampling the horizontally 
upsampled and downsampled reference pixel blocks 
separately for fields a and b, (ii) if the motion vectors 
are fractional, combining the separately upsampled a 
and b fields into a reference frame block, then perform- 

6Q ing a half pixel interpolation on the combined frame 
block, and (iii) separately vertically downsampling the 
upsampled a and b fields. 

14. The method of claim 1 further comprising the step of 
storing only the reconstructed field blocks as the reference 
pixel blocks. 

65 15. An apparatus for downconverting received frame and 
field coded DCT coefficient blocks to reconstructed pixel 
field blocks, the apparatus comprising: 
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an 1DCT module arranged to convert the received frame 
coded DCT coefficient blocks to converted field coded 
DCT coefficient blocks and to perform an IDCT on the 
converted field coded DCT coefficient blocks and on 
the received field coded DCT coefficient blocks in order 5 
to produce downconverted pixel field blocks; and, 

a motion compensator arranged to apply motion 
compensation, as appropriate, to the downconverted 
pixel field blocks in order to produce the reconstructed 
pixel field blocks. 10 

16. The apparatus of claim 15 wherein the received frame 
and field coded DCT coefficient blocks have motion vectors 
associated therewith, and wherein the motion compensator 
comprises: 

a selector arranged to select reference pixel blocks based 15 

upon the motion vectors; and, 
an adder arranged to add the reference pixel blocks to the 

pixel field blocks to form the reconstructed pixel field 

blocks. 

17. The apparatus of claim 16 wherein the selector 20 
comprises: 

an upsampler arranged to upsample the reference pixel 
blocks; and 

a downsampler arranged to downsample the upsampled 
reference pixel blocks. 25 

18. The apparatus of claim 16 wherein the selector 
comprises: 

a translator arranged to translate the motion vector to a 
reference address; 

30 

a horizontal upsampler and downsampler arranged to 
horizontally upsample and downsample the reference 
pixel blocks based on the reference address; 

a first vertical upsampler and downsampler arranged 
during field prediction to vertically upsample and 35 
downsample the horizontally upsampled and down- 
sampled reference pixel blocks when field coded DCT 
coefficient blocks are received; and, 

a second vertical upsampler and downsampler arranged 
during frame prediction to (i) vertically upsample the 40 
horizontally upsampled and downsampled reference 
pixel blocks separately for fields a and b, (ii) if the 
motion vectors are fractional, combine the separately 
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upsampled a and b fields into a reference frame block, 
then perform a half pixel interpolation on the combined 
frame block, and (iii) separately vertically downsample 
the upsampled a and b fields. 

19. The apparatus of claim 15 wherein the IDCT module 
comprises: 

a discarder arranged to discard high order DCT coeffi- 
cients from the frame coded DCT coefficient blocks to 
produce downsized frame coded DCT coefficient 
blocks; 

a rcorderer arranged to reorder the downsized frame 

coded DCT coefficient blocks; 
a column IDCT module arranged to perform an IDCT by 

columns on the reordered and downsized frame coded 

DCT coefficient blocks; and, 
a separate row IDCT module arranged to perform an 

IDCT by rows on results from the column IDCT 

module. 

20. The apparatus of claim 19 wherein the column IDCT 
module comprises a multiplier arranged to multiply the 
reordered and downsized frame coded DCT coefficiem 
blocks by [Q3], wherein [Q3]-CIP4 2 3[T / ][IT8 2 ], wherein 
[IP4 2 ] is in the following form: 

\P4 T 0 1 

"Ho « r . 

wherein [P4] is in the following form: 

rOO rOl r02 r03 0 0 0 0 

rIO rU r!2 rl3 0 0 0 0 

r20 r21 r22 t23 0 0 0 0 

r30 r3l r32 r33 0 0 0 0 / 
\P4] = / VT 

J 0 0 0 00000/ 

0 0 0 00000 

0 0 0 00000 

,0 0 0 0000 0, 

wherein rOO, rOl, . . . r33 are four point DCT basis vectors, 
wherein [Ty] is in the following form: 



tOO 0 (01 0 102 0 103 0 

tJO 0 ill 0 f/2 0 (13 0 

i20 0 ill 0 i22 0 i23 0 

t30 0 [31 0 (32 0 i33 0 

(40 0 (41 0 142 0 143 0 

150 0 t5t 0 (52 0 i53 0 

i60 0 [61 0 i62 0 i63 0 

i70 0 (71 0 t72 0 i73 0 

0 m 0 tOi 0 (02 0 i03 

0 1/0 0 ili 0 f/2 0 [13 

0 [20 0 ai 0 [22 0 (23 

0 t30 0 (31 0 (32 0 (33 

0 [40 0 fti 0 t42 0 (43 

0 (50 0 i5l 0 (52 0 (53 

0 (60 0 i(S/ 0 (62 0 (63 

0 t70 0 (71 0 (72 0 (73 



104 0 (05 0 (06 0 107 0 

ii4 0 r/J 0 r!6 0 tl7 0 

0.4 0 (25 0 t26 0 t27 0 

(34 0 (35 0 (36 0 (37 0 

(44 0 (45 0 [46 0 (47 0 

(54 0 i55 0 (56 0 i57 0 

(64 0 [65 0 (66 0 [67 0 

(74 0 [75 0 (76 0 (77 0 

0 (04 0 (05 Q (06 0 (07 

0 (14 0 (15 0 (16 0 (17 

0 (24 0 (25 0 i26 0 (27 

0 (34 0 (35 0 (36 0 (37 

0 (44 0 (45 0 (46 0 (47 

0 (54 0 (55 0 (56 0 (57 

0 (64 0 (65 0 (66 0 (67 

0 (74 0 (75 0 (76 0 (77 
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wherein [ITSJ is in the following form: 
7^ 0 I 
o re 7 " 



1/78:1 = 



[73] = 



wherein [T8] is in the following form: 

tOO (01 (02 (03 (04 (05 (06 (07 

tlO ill (12 (13 (14 iI5 tl6 (17 

(20 i2l ill (23 (24 i25 i26 0.7 

(30 (31 i32 t33 (34 [35 t36 [37 

140 t4l 142 r43 (44 (45 (46 t47 

t50 t5l i52 t53 (54 (55 (56 (57 

t60 (61 (62 (63 164 (65 (66 (67 

170 (71 (72 (73 i74 (75 (76 (77 



and wherein tOO, tOl, . . . t77 are eight point DCT basis 
vectors. 

21. The apparatus of claim 20 wherein the multiplier is a 
first multiplier, and wherein the row IDCT module com- 
prises a second multiplier arranged to multiply results from 
the first multiplier by [T4]/^I, and wherein [T4] is given by . 
the following equation: 

f rOO rOl r02 r03 



20 



[T4] = 



rlO rll r!2 rl3 
r20 rll r22 r23 
r30 r31 r32 r33 



22. The apparatus of claim 15 wherein the IDCT module 
comprises: 

a discarder arranged to discard high order DCT coeffi- 
cients from the field coded DCT coefficient blocks to 
produce downsized field coded DCT coefficient blocks; 

a column IDCT module arranged to perform an IDCT by 
columns on the downsized field coded DCT coefficient 
blocks; and, 

a separate row IDCT module arranged to perform an 
IDCT by rows on results from the column IDCT 
module. 

23. The apparatus of claim 22 wherein the column IDCT 
module comprises a multiplier arranged to multiply the 
downsized field coded DCT coefficient blocks by \JAfHl t 
wherein [T4] is given by the following equation: 



28 



[74] = 



rOO tOl r02 t03 

rlO rll r!2 r!3 

r20 rll r22 r23 

r30 r3l r32 r33 



and wherein rOO, rOl, . . . r33 are four point DCT basis 
vectors. 

24. The apparatus of claim 23 wherein the multiplier is a 
first multiplier, and wherein the row IDCT module com- 
prises a second multiplier arranged to multiply results from 
the first multiplier by [T4]A^2. 

25. The apparatus of claim 24 wherein the discarder is a 
first discarder, wherein the column IDCT module is a first 
column IDCT module, wherein the row IDCT module is first 
row IDCT module, and wherein the IDCT module further 
comprises: 

a second discarder arranged to discard high order hori- 
zontal DCT coefficients from the frame coded DCT 
coefficient blocks to produce downsized frame coded 
DCT coefficient blocks; 
a reorderer arranged to reorder the downsized frame 

coded DCT coefficient blocks; 
a second column IDCT module arranged to perform an 
IDCT by columns on the reordered and downsized 
frame coded DCT coefficient blocks; and, 
25 a second row IDCT module arranged to perform an IDCT 
by rows on results from the second column IDCT 
module. 

26. The apparatus of claim 25 wherein the second column 
IDCT module comprises a third multiplier arranged to 
multiply the reordered and downsized frame coded DCT 

30 coefficient blocks by [Q3], wherein [QSH^ftTJITSj], 
wherein [IP4 2 ] is in the following form: 

P4 T i 



E/W z ] = 



0 P4 T 



35 



wherein [P4] is in the following form: 
rOO rOl t02 r03 0 0 0 0 



[W] = 



rlQ rll r!2 r!3 0 

r20 r2l r22 r23 0 

r30 r3I r32 r33 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 



0 0 

0 0 

0 0 

0 0 

0 0 

0 0 

0 0 



wherein rOO, rOl, . . . r33 arc four point DCT basis vectors, 
wherein [T^] is in the following form: 



r 100 o (01 o 

do 0 til 0 

(20 0 (21 0 

(30 0 (31 0 

140 0 (41 0 

(50 0 (51 0 

(60 0 (61 0 

[(70 0 (71 0 

ro (00 o (Oi 

0 (10 0 /// 

o t2o o ai 

0 (30 0 (31 

0 (40 0 (41 

0 (SO 0 (51 

0 (60 0 (61 

[O (70 0 (71 



(02 0 (03 0 

(12 0 (13 0 

(27 0 (23 0 

(32 0 1 33 0 

(42 0 (43 0 

(52 0 (53 0 

(62 0 (63 0 

(72 0 (73 0 

0 (02 0 (03 

0 (12 0 (13 

0 (22 0 (23 

0 (32 0 (33 

0 (42 0 (43 

0 (52 0 (53 

0 (62 0 (63 

0 (72 0 (73 



(04 0 (05 0 

(14 0 (15 0 

(24 0 (25 0 

(34 0 (35 0 

(44 0 (45 0 

(54 0 (55 0 

(64 0 t65 0 

(74 0 (75 0 

0 (04 0 (05 

0 (14 0 (IS 

0 (24 0 (25 

0 (34 0 (35 

0 (44 0 (45 

0 (54 0 (55 

0 (64 0 (65 

0 (74 0 (75 



(06 0 (07 0 

t!6 0 (17 0 

(26 0 (27 0 

(36 0 t37 0 

(46 0 (47 0 

(56 0 (57 0 

(66 0 (67 0 

(76 0 i77 0 

0 (06 0 (07 

0 (16 0 (17 

0 [26 0 (27 

0 (36 0 (37 

0 (46 0 (47 

0 (56 0 (57 

0 rtS6 0 (67 

0 r76 0 r77 
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wherein [ITSJ is in the following form: 



30 



[/7S I J = 



TP 0 

0 re 7 " 



wherein [T8] is in the following form: 



[T8] = 



t00 tOl t02 103 (04 i05 t06 t07 

110 til tJ2 tl3 1 14 tI5 (16 ill 

i20 tU t22 t23 i24 t25 t26 t27 

t30 t31 i32 t33 i34 t3S t36 i37 

t40 i4l t42 t43 (44 t45 (46 t47 

{50 t5l 152 t53 (54 155 t56 157 

{60 (61 i62 r63 t64 t65 166 t67 

170 t71 {72 t73 t74 {75 {76 {77 



15 



20 



25 



and wherein tOO, tOl, . . . t77 are eight point DCT basis 
vectors. 

27. The apparatus of claim 26 wherein the second row 
I DCT module comprises a fourth multiplier arranged to 
multiply results from the third multiplier by [T4]/^2~. 

28. The apparatus of claim 22 wherein the discarder is a 
first discarder, wherein the column IDCT module is a first 
column IDCT module, wherein the row IDCT module is first 
row IDCT module, and wherein the IDCT module further 
comprises: 

a second discarder arranged to discard high order DCT 30 
coefficients from the frame coded DCT coefficient 
blocks to produce downsized frame coded DCT coef- 
ficient blocks; 

a reorderer arranged to reorder the downsized frame 

coded DCT coefficient blocks; 35 
a second column IDCT module arranged to performing an 

IDCT by columns on the reordered and downsized 

frame coded DCT coefficient blocks; and, 
a second row IDCT module arranged to performing an 

IDCT by rows on results from the second column IDCT 40 

module. 

29. The apparatus of claim 15 wherein the motion com- 
pensator comprises: 

a selector arranged to select reference pixel blocks based 

upon the motion vectors; 
an adder arranged to add the reference pixel blocks to the 

pixel field blocks to form the reconstructed pixel field 

blocks; and, 

a reference picture memory arranged to store only the 
reconstructed pixel field blocks as the reference pixel 
blocks. 

30. An apparatus for downconverting received frame and 
field coded DCT coefficient blocks to downconverted pixel 
field blocks, the apparatus comprising: 

a first IDCT module arranged to convert the received 
frame coded DCT coefficient blocks to converted field 
coded DCT coefficient blocks and to perform a down- 
converting IDCT on the converted field coded DCT 
coefficient blocks in order to produce first downcon- 
verted pixel field blocks; and, 

a second IDCT module arranged to directly perform a 
downconverting IDCT on the received field coded DCT 
coefficient blocks in order to produce second down- 
converted pixel field blocks. 

31. The apparatus of claim 30 wherein the received frame 
and field coded DCT coefficient blocks have motion vectors 



45 



50 



55 



60 



associated therewith, and wherein the apparatus further 
comprises: 

a selector arranged to select reference pixel blocks from 
a reference picture memory based upon the motion 
vectors; and, 

an adder arranged to add the reference pixel blocks to the 
first and second downconverted pixel field blocks in 
order to form reconstructed pixel field blocks. 

32. The apparatus of claim 31 wherein the selector 
comprises: 

an upsampler arranged to upsample the reference pixel 
blocks; and, 

a downsampler arranged to down sample the upsampled 
reference pixel blocks. 

33. The apparatus of claim 31 wherein the selector 
comprises: 

a translator arranged to translate the motion vector to a 
reference address; 

a horizontal upsampler and downsampler arranged to 
horizontally upsample and downsample the reference 
pixel blocks based on the reference address; 

a first vertical upsampler and downsampler arranged 
during field prediction to vertically upsample and 
downsample the horizontally upsampled and down- 
sampled reference pixel blocks when field coded DCT 
coefficient blocks are received; and, 

a first vertical upsampler and downsampler arranged 
during frame prediction to (i) vertically upsample the 
horizontally upsampled and downsampled reference 
pixel blocks separately for fields a and b, (ii) if the 
motion vectors are fractional, combine the separately 
upsampled a and b fields into a reference frame block, 
then perform a half pixel interpolation on the combined 
frame block, and (iii) separately vertically downsample 
the upsampled a and b fields. 

34. The apparatus of claim 30 wherein the first IDCT 
module comprise: 

a discarder arranged to discard high order horizontal DCT 
coefficients from the frame coded DCT coefficient 
blocks to produce downsized frame coded DCT coef- 
ficient blocks; 

a reorderer arranged lo reorder the downsized frame 

coded DCT coefficient blocks; 
a column IDCT module arranged to perform an IDCT by 

columns on the reordered and downsized frame coded 

DCT coefficient blocks; and, 
a separate row IDCT module arranged to perform an 

IDCT by rows on results from the column IDCT 

module. 

35. The apparatus of claim 34 wherein the column IDCT 
module comprises a multiplier arranged to multiply the 
reordered and downsized frame coded DCT coefficient 
blocks by [Q3], wherein [Q3]-[IP4 2 ][T / ][IT8 2 ], wherein 
[IP4 2 ] is in the following form: 



\P4 T 



65 



0 
P4 r 
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wherein [P4] is in the following form: 



[P4] = 



rOO rOl r02 rOJ 0 0 

rlO rll rl2 rl3 0 0 

r20 rll t22 t23 0 0 

r30 r3l r32 r33 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 
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wherein rOO, rOl, . . . r33 are four point DCT basis vectors, 
wherein [T^] is in the following form: 



[T4]: 



32 



(00 rOl (02 r03 

rlO rll r!2 rl3 

r20 r2l r22 r23 

r30 r3l r32 r33 



37. The apparatus of claim 30 wherein the second IDCT 
module comprises: 

a discarder arranged to discard high order DCT coeffi- 
cients from the field coded DCT coefficient blocks to 
produce downsized field coded DCT coefficient blocks; 

a column IDCT module arranged to perform an IDCT by 
columns on the downsized field coded DCT coefficient 
blocks; and, 



too 0 


rOl 
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t02 
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t03 
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104 


0 
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(06 0 


(07 0 


(10 0 


til 


0 


rl2 
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t!3 
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(14 
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(15 
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(16 0 


(17 0 


t20 0 


ai 


0 


i22 


0 


t23 
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(24 
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(25 
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(26 0 


(27 0 


t30 0 


t3l 
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t32 
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(33 
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(34 
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(35 
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(36 0 


(37 0 


(40 0 


(41 


0 


142 
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143 
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144 
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(45 
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(46 0 


(47 0 


(50 0 


(51 
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(52 
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i53 
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(54 
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(55 
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(56 0 


(57 0 


(60 0 


(61 


0 


i62 


0 


163 


0 


t64 
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(65 


0 


(66 0 


(67 0 


(70 0 


171 


0 


t72 


0 


(73 


0 


174 
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(75 
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(76 0 


(77 0 


0 too 
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101 


0 


m 


0 (03 


0 


104 


0 


(05 


0 (06 


0 (07 


0 (10 


0 


(11 


0 


t!2 


0 tl3 


0 


t/4 


0 


(15 


0 t!6 


0 (17 


0 (20 


0 


121 


0 


t22 


0 r23 


0 


t24 


0 


as 


0 (26 


0 (27 


0 (30 


0 


i3l 


0 


t32 


0 t33 


0 


t34 
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(35 


0 (36 


0 (37 


0 t40 


0 


14! 


0 


t42 


0 143 


0 


t44 


0 


(45 


0 (46 


0 (47 


0 (50 


0 


i5l 


0 


t52 


0 (53 


0 


(54 


0 


(55 


0 (56 


0 (57 


0 i60 


0 


(61 


0 


i62 


0 i63 


0 


(64 


0 


(65 


0 (66 


0 (67 


0 170 


0 


i7f 


0 


172 


0 r?3 


0 


(74 


0 


(75 


0 (76 


0 (77 



wherein [rT8 2 ] is in the following form: 
T8 r 0 

[ 0 7^ 

wherein [T8] is in the following form: 



[75] = 



(00 (01 

tio tii 

(20 a i 

(30 i3l 

(40 141 

(50 (51 

(60 (61 

(70 t7I 



(02 (03 

(12 (13 

(22 (23 

(32 (33 

(42 (43 

(52 (53 

(62 (63 

(72 (73 



m (05 

(14 (15 

(24 (25 

(34 i35 

:44 (45 

(54 (55 

(64 (65 

(74 (75 



(06 (07 

(16 (17 

(26 (27 

(36 (37 

(46 (47 

(56 (57 

(66 (67 

(76 (77 



T8 r 0 
0 T8 T 



a separate row IDCT module arranged to perform an 
IDCT by rows on results from the column IDCT 
module. 

38. The apparatus of claim 37 wherein the column IDCT 
module comprises a multiplier arranged to multiply the 
downsized field coded DCT coefScient blocks by fT4/>/2, 
wherein [T4] is given by the following equation: 



45 



50 



55 



and wherein tOO, tOl, . . . 177 are eight point DCT basis 
vectors. 60 



36. The apparatus of claim 35 wherein the multiplier is a 
first multiplier, and wherein the row IDCT module com- 
prises a second multiplier arranged to multiply results from 65 
the first multiplier by [T4]/^Z, and wherein [T4] is given by 
the following equation: 



(00 rOt f02 (03 

rlO rll rl2 r/3 

r20 r2l r22 r23 

r30 r3l r32 r33 



and wherein rOO, rOl, . . . r33 are four point DCT basis 
vectors. 

39. The apparatus of claim 38 wherein the multiplier is a 
first multiplier, and wherein the row IDCT module com- 
prises a second multiplier arranged to multiply results from 
the first multiplier by \T4]p2. 

40. The apparatus of claim 39 wherein the discarder is a 
first discarder, wherein the column IDCT module is a first 
column IDCT module, wherein the row IDCT module is first 
row IDCT module, and wherein the first IDCT module 
further comprises: 

a second discarder arranged to discard high order hori- 
zontal DCT coefficients from the frame coded DCT 
coefficient blocks to produce downsized frame coded 
DCT coefficient blocks; 

a reorderer arranged to reorder the downsized frame 
coded DCT coefficient blocks; 

a second column IDCT module arranged to perform an 
IDCT by columns on the reordered and downsized 
frame coded DCT coefficient blocks; and, 
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a second row IDCT module arranged to perform an IDCT 
by rows on results from the second column IDCT 
module. 

41. The apparatus of claim 40 wherein the second column 
IDCT module comprises a third multiplier arranged to 
multiply the reordered and downsized frame coded DCT 
coefficient blocks by [03], wherein [Q3HIP4 2 ]P>][rT8 2 ], 
wherein [IP4 2 ] is in the following form: 



P4 T 0 
0 P4 T 
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wherein [T8]] is in the following form: 



10 



[78]- 



(00 iOI 

ito til 

(20 ill 
(30 i3J 
(40 HI 
t50 i51 
(60 (61 
(70 i71 



(02 t03 

(12 i/3 

t22 t23 

x32 iS3 

t42 t43 

152 i53 

162 t63 

172 173 



i04 rOS 

tl4 tl5 

t24 as 

t34 t35 

t44 i4S 

(54 155 

t64 t6S 

t74 175 



(06 (07 

1 16 tI7 

u6 t27 

i26 i37 

146 (47 

156 t57 

166 [67 

176 (77 



wherein [P4] is in the following form: 



[P4\ = 



rOO rOf 

NO rlt 

r20 r2l 

r30 r3l 

0 0 

0 0 

0 0 

0 0 



r02 r03 

r!2 r\3 

r22 r23 

r32 r33 

0 0 



0 0 0 0 
0 0 0 0 
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wherein rOO, rOl, . . . r33 are four point DCT basis vectors, 
wherein [T^] is in the following form: 



and wherein tOO, tOl, . . . t77 are eight point DCT basis 
vectors. 

42. The apparatus of claim 41 Wherein the second row 
IDCT module comprises a fourth multiplier arranged to 
multiply results from the third multiplier by [T4Y*2. 

43. The apparatus of claim 37 wherein the discarder is a 
first discarder, wherein the column IDCT module is a first 
column IDCT module, wherein the row IDCT module is a 
first row IDCT module, and wherein the IDCT module 
further comprises: 

a second discarder arranged to discard high order hori- 
zontal DCT coefficients from the frame coded DCT 
coefficient blocks to produce downsized frame coded 
DCT coefficient blocks; 

a reorderer arranged to reorder the downsized frame 
coded DCT coefficient blocks; 
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a second column IDCT module arranged to perform an 
IDCT by columns on the reordered and downsized 
frame coded DCT coefficient blocks; and, 

a second row IDCT module arranged to perform an IDCT 
by rows on results from the second column IDCT 
module. 

44. The apparatus of claim 30 further comprising: 

a selector arranged to select reference pixel blocks based 

upon the motion vectors; 
an adder arranged to add the reference pixel blocks to the 

pixel field blocks to form reconstructed pixel field 

blocks; and, 

a reference picture memory arranged to store only the 
reconstructed pixel field blocks as the reference pixel 
blocks. 
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